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High-Efficiency Cloning of Full-Length cDNA 

HIROTO OKAYAMA AND PAUL BSRO* 
Depmmtnt cf f locktmistry, Stanford UiUveniTy School ef Mtdicltu, Stanford, Cddb'mia 94305 

Receivtd 20 October 19Bl/Acc6pted t Dtcwnbcr 1981 

A widtly rMOEoized difleulty of prciently used methods for cDNA oloning is 
obtainiiig cDNA ieim6nt& thftl contam the eniire nucleotide sequence of the 
comspcmdifig mRNA. The clooing procedure described here mingfttes this, 
■hortcommg. Of. the 10* ptasmid-cDNA recombinants obtained per « of rabbit 
reticulocyte mRNA, about 10% contained a complete a- or $-globin mRNA 
eequence, wd at least 30 to 50%, but very likely mote, contained the entire globin 
coding regions. We anribute the high efflclency of cloning fuU- or nearly full^ 
length cDN A to Q the ftct that the plasmld DN A vector tteelf serves as the primer 
for first* and second-stnnd cDNA symhesis, (ii^ the lack of any nuclease 
trcatxnent of the products, and (ill) the fkct that ooe of the steps in the procedure 
results in preferential cloning of recombinanu with full-length cDNA*s over those 
with truncated cDNA^s. 



The availability of compleaentary DNA 
(cDNA) copies of mRNA's wovides an ex- 
tremely powerful tool for analyzing the struc- 
ture j organizadon* and expression m eukaryote 
genes (4. 3, 9. 10, 16, 19, 36, 38)- Aside frwn the 
utility of cDNA's for defining the initiation, 
codingi and terminatUm seqtiencefi of mRNA^s, 
their use as hybridization probes makes it possi- 
ble to searsb for» isolate, identify, and characte^ 
ize the corxesponding genes ttom chromosomal 
DNA. Indeed, It was comparisons between 
doned cDNA*s and thdr genomic counterparts 
that uncovered the existence of intervening se» 
quences (4. S, 9, 19, 38) and spUcing (37) and the 
occurrence ol genomic rearrangements in the 
formation of fiuiodonal immunoglobuUn genes 
(3), More rccenUy, the cloning of cDNA copies 
of RNA vims genomes (e.g* , vesicular stomatitis 
[321. polio [22] p and influenza vimses [35]) has 
opened the way to a more refined understanding 
of these viruses' structure, replication, and 
expressioa» as well as providing a simpler route 
for the developmeni of antiviral vaccines (17). 

With present techniques for cDNA cloning 
(12, 31, 33, 42) the yidd of recombinant DNAs 
that have full-length oDftA sequences, i.e., 
cDNA*i containing the en the nudeodde se* 
quence of the mature mRNA, is low, QeneraKy> 
most cDNA clones contain 3 ' -untranslated and 
variable amounts of the protein-coding se- 
quence; obtaining cloned cDNA*s with complete 
5'-untransl«kad and protein-codrng sequences Is 
rarer> purtlcularly if the xnRNA codes for a large 
protein (32» 35). Although such tmncatcd 
cDNA*s are still useful as hybridization wobes 
(IS), they cannot direct the synthesis of com- 
plete proteins after their introduction mto bacte- 



rial or mammalian cells via appropriate expres* 
sion vectors. Consequently, we have sought to 
devise a cDNA cloning method thai increases 
the probability of obtsMng recombinanu with 
ftin-length cDNA Inserts. 

Presently, the Initial cDNA copy of a mRNA 
is synthesized with reverse transcriptase (39) 
using as primer either oHgodeoxythymidylatc 
[oligo(dT)] annealed to the polyadraylate [po* 
ly(A)l taU (U, 18) or an oligonucleotide annealed 
to a complementary sequence in the body of the 
mRNA (36» 40). The quality of the reverse 
transcriptase, the integrity and secondaxy struc- 
ture of the mRNA, and the raaction conditions 
influence the length of the primary reverse tran- 
script and, therrfore, the compbteness of the 
subsequently oloned cDNA. Second^strand syn- 
thesis is at best a poorly controlled step, since it 
relies on the abOiiy of Escherichia coli DNA 
polymerase I {Poll) to use the initial reverse 
transcript as both a primer and template, the ciid 
result being a hairpin double-stranded DNA with 
the 5' end of the mRNA sequence in the form of 
a single-strtnded loop of variable size and loca? 
tion (11). SI nuclease digestion of the single- 
stranded DNA loop, which must precede the 
addition of homopolymeric tails to the ends of 
the cDNA, invanabfy removes portions of (he 
cDNA correspondlns to coding or S'-proximal 
portions of we osRNA. Sevcrai innovations, 
such as enrichment for the particular mRNA 
sequence prior to cDNA synthesis (S), fraction- 
ation of sinde-strand or double-strand cDNA to 
enrich for particular size classes (6, 10, 36), or 
even alternative procedures for priming the syd^ 
thesis of the second strand that eliminate the 
need for the nudeasc digestion of S'-proximal 




scqueoces (24, 33)i hat improved the yield of 
ftiU- or nearly ftiU-lenfitb &DNA or at least 
produced cDNA'» wbfa mtaci j ^proximal mide* 
Qtade sequencei (24). 

Hera we deieribe 4 modification in the cDNA 
cUmloE procedive that permits the rteoveryf In 
hieh yields. qC plaimld recQmbinants with fiill- 
or nearly fuQ-leiatth cDNA inscrt$. The proee* 
dure use$ % plaimid DNA vector which itself 
fiervei u the primer for first- and ultimately 
second-Btrand cDNA symhem; moreoveri one 
of the sU^i ia desiined to enrich lar recombi* 
aanti containing fixU-lenfth cDNA'ft over those 
iKiCh tnincated cDNA's. Tbit procedure hu been 
applied lUcceBSfiiUy to the dontng of fiill4ength 
a- and 3-|lobln cDNA*s ftom rabbit reticulooyte 
mRNAi 

MATERIALS AND METHODS 

Cbenicali, cmyiMe, and plasmfdi. OBgD(dT)* aad 
ollgodeoxyadenyhte [c>liBQ(dA)]*^ltaileiei (both type 
S) w«r« purchaaed from Collabontlve Roaearah Inc. 
The revorte mfiscriptaie was the ariaii oyelobtafto* 
sis vin}i enzyme obtataied torn J. Beard at the Nadon- 
al iMtlnitei of Healch. S. etUi DNA Uaate was provid- 
ed by I. R. Lehnian, eoU DNA polymmte I was 
provided by J. Widom and S. Shtrir« and Eco91 
endooudcase came from J. Carlion (all from Stanfoi^ 
Uxiivenity). Calf fchyimia terminal deoxynudeotidyl 
transftraae and £. eoHRNaie H ware purcbaMd from 
PL Biochemieal Co, ; AKmini and Kphl endonueleaaei 
were products of Bethecda Reicarch LaborazDrios 
Inc., Hpal. Avail, and i*virll endonucleaiei ware 
from New Bn|lAnd BioLabs, and PsA endomitleaai 
was obtained from BoehrinBcr-Manaikeim. The 
pBR32Z-aV40 recombinaittfi that provide the vector^ 
pmncr DNA and linker DNA ftaimant (Fie* D were 
conatzuctad by Subnunafd. One centaioed a simian 
virus 40 (S V4Cf) DNA aaament coTFoapondlng to m^ 
twsition 0.71 to O.M donad between the ^uU and 
HinSai litei of pBB322 DNA, and the other had & 
segment from map poutioo 0.19 to 0.32 inaened be> 
tweeo the BamM and ffMm siteft Of tba plaamid 
DNA. 

P i ep a r a tiao of ndtNA, Rabbii Tsticulocyte mRNA, 

earichad with a-glbbin aRNA» was prepared from a 
postpdysoflul fupematant of a retieelocyte lysate 
obtained from phenylhydraziai-created rabbitf (28). 
The mRNA was reeoveied after phenol extraction by 
alcohol pitCQritatiop following two cycles of adsorp- 
tion and dution from an dU|eo(d7>ecUulo8e column 
(1). In recent atlenptB to prepare cDNA Ubtvias from 
other edh (H. Obtyama and P. Ber|, UnpuMtihed 
data) we bav« found the guaAidininm thiocyanate 
method (7) to be superior t6r the praparazion of mRNA 
that ii readOy revene transcribed. 

Pr^antian of tartar prteat $MA ni^ d&tallad 
Itakar DNAi« (For a diagram of Ulti proceduir, aee^. 
1), A 400-H« sampia tf pBB322*SV4Q (m^ units 0.71- 
0.86) DNA was dieesM at 37t: with 700 U Of KpM 
endonndeaia in a reaction mixture (0.4 ml) containing 
6 mM Trift-hydrochloride (pH 7.5), fi mM MgClj, 6 
mhl NaQ« 6 mM 2-inereaptoeihanol, and O.I mg of 
bovina sarum albumin (BSA) per ml. After 5 hi the 
dieaitioA was terminated with 40 of 0.25 M BDTA 
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FIG. 1. Preparation of plftsmid primer and linker 
DNA^ The unshadad ponion of each ling is pBIU22 
DNA, and cbe ihaded or stippled legments are from 
SV40 DNA, Tho numbers next to the reBtriotion fitte 
designations ate the correaponding SV40 DNA map 
coordhtates. 



(pH 8.0) and 20 pi of 20% sodium dodeeyl sulbte 
(SDS); the DNA wai recovered after extraction with 
waterwuraied phenQt-CHQ] 0:1) (hereafter referred 
to a« pheool-CHClO and eth&nol precipitation. Homo* 
polymer tails averaging <iQ, but not more than 80, 
deoxyihymidylau (dT) residues per ead were add- 
ed to the Kp^il endonuclease-generaicd nfmini with 
calf thymus tennlnal daoxynucleotidyl cransftoase as 
follows. The reaction milxture (0.2 ml] contained 140 
mM sodium cacodylate-30 mM Tris-hydrochlorlde 
(pH 6.8) as bu&r, with 1 mM Coda, 0.1 mM dithiotb* 
reltol^ 0.2S mM dTTP, the endonncleaae-dl^ftt- 
ed DNA, and 400 U of the lerminai deoxynudeolidyl 
transferase. Alter 30 rain at 37T the rpaciion was 
Itopped with 20 of 0.25 M BDt A (pH S.Q) and 10 
of 10% SOS, and the DNA was recovered after several 
excractlont with phenoi-CHQ) by etbanol prsQipiU" 
tion. The DNA was then Rested with 17 u or Mpai 

andODuclease in 0.2 ml containing 10 mM Tris-hydro- 
chloride (pH 7.4), 10 mM MgO,, 20 mM KQ, 1 mM 
diiUothrdtol, and 0.1 mg of BSA per ml for 5 h ai 

37^. Hie large DNA fr^ent, which contained the 
origin of pBR3Z2 DNA repUoailon and ttte gepe con- 
ferring axnpiciUin radstanoe, wai purified by agamse 
<196) ia! eleetrophortftift and recovered froth the gel by 
^ raodiftcadon of (he ghui powder method (41). The 
dT-tatted DNA was ftxrther purttlad by adsenition and 
eUuion from an oligo(d A)'<ca>|ulose column ai fbllow». 
The UNA was diiBolved tn ; ml of 10 mM Tris- 
hydrochlorida (pH 7.3) buffer contaifling 1 mM BDTA 
and 1 M NaQ, cooled to fSfC, and ^ypUed to an oKgo 
(dA)-calhilou column (0.6 by 2.5 cm) equilibrated 
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with the sunt bate Bt 0*C. Th« cduJik "^^^ 
with ttw am» bote at 0»C and cloud with witer tf 
room tempentore. Tto «l«tf* DNA 2^ J* J^"* 
D»eipltatid with •thuol ud disiolyed h lOQ jiltf 10 
mMTttrtydiochlotlde tp^ i ™ 

Ttio dUiodeoxyKtiaiiyltfo [oM^COVunod bsjm 

DNA P«W«d by djt^^ ^„^^?ffi2: 
S V40 (map units with 120 U of Ad endontt- 

dMM in 0.2 fid eonttlBitii ( mM Tris^ydrpcUorlde 
(pH 7,4), 6 mM M|Ck> 6 mM 24iierc«|»oMhinol, SO 
mM NaQ, nd 0.1 m of BSA per od. Afur l.S ta at 
37T the mcdon mixture was iXtrteCtd with phenol- 
CHCb and iba DNA was prec^tatfid with lUtt^. 
Tails of 10 10 IS dMxygaanyiatf (dG) rtlidueB 
weft then added p«r end wifh «) U al Mrndnal 
deoxyattclaatldyl tiusfbnae in the same reaatukn mx*. 
tun (SO ul) deieribtd above* excmt to 0.1 bAI dOOV 
twlaciaa dTTP. After 20 min It SrC the mixtnie was 

axtnctad with phenoWCKCl»« thd after the DNA was 
ptedpiuted with othaaol it was d^tad wbb 50 U of 

JEfirtdni endonaclease in 50 |d eontaioink 20 mM Trit- 
hydxtwhloride (pH 7-4), 7 mM MgQa, 60 mM Naa 
and 0.1 of BSA at 37*C for 1 h. The snwB oU^o 
(dO)'taUed iinlw DNA wai purified by woso |d 
a.B96) eltctrophotecLS and rieovered as deacatooo 

V^eparatiatt and Juulan ef glshb HDfUL For a 
diagnui of the preparation and cloninf of ^Mn 
cDNA, aaa Plo, 2, 

Stap I. eDNAiT^liesls. Tlic reaction mixnire (10 Mi) 
COtttJned 50 mM Tiis-hydrochloride (pH S,3). I mM 
MftCli, 30 itiM Ka> 0.1 mM dithlochrcitolp 2 mM aach 
dATP, dTTP, dGTPi and I^PWCt? (ISO cpm/pmol), 
0.2 M of the reticulocyte mRNA (1 pnol of eioUa 
mRNA), 1,4 M of the vocterprimer DNA (0.7 pmol of 
primer end)i and S U of revarse tt«i»eriptasa, [Ths 
molar ndo of polyCA)*^ mRNA to vcctof^primer DN A 
should be ireater than l.Q» and in our experimenti 



taaiedtem 1 .5 to 1). cDNA tyn^j^ was hdtiaiedby 
the addition of reverse transeriptase and eontfamed at 
irC for 20 min. By lUi time the rttc of dCTP 
iDCOfpontion had levded off and more than tiOK of the 
Planar was tuflized fiir cDNA synthesis. The reaction 
wu Slopped with 1 |U of 0*25 M BDTA (pH 840 and 
0,5 fd of 10ft BD$; 10 |J of phenol-(3ICl» was addad. 
and the solucioa wai blended vigorously in a Vonax 
mixer and then centrifUfed. After addioi 10 »J of 4 M 

ammonium aoatate and 40 iiU of ethandi to the aqueous 
phase, the aolution wai ehtUed with dry lee fbr 15 mtn, 
warned to room temperaiura with centfe shalcioi to 
cUiiolve unreacted daoxynuclaoslde tr^osphaxes 
that had predphated during chilUnii and eentri&iged 
for 10 min In an Eppendorf microAJiek Tl»a pdlet was 
dissolved in 10 |d of 10 mM Trls-hydrocUoridp (pH 
7.3) and 1 mM SDTAi mixed wich 10 mI of 4 M 
amffloninm aoetatSf and reprecipitated with 40 Ml of 
ethanol, a proeeduia which removei mora than 99% of 
oanaeted deoxynudeoside tripbospbOWfe, Ihe pe&at 
was rinsed with ethand. 

8tiV %l OUgedaexyqrtldylais [flllfD(dC)I addttiim. 
The pdiet eentaining the plaamid'6DNA:niRNA was 
dissolved In 15 ^ of 140 mM sodium eaeodytete-30 
mM Tris-hydrodilcride (pH €.S) buft^ containing 1 
mM CoCb, 0.1 mM dithiothreitd, 0.2 w of pohrtA), 
» mM pP]dCrP (6,000 cpm(pffld)p and 18 U ef 
termind deoxynudsicMidyl trvnsfsrase, The loaction 
was cairied out at 37*0 fterS min lo pamit the addition 
of 10 to IS reaidues of dCMP_per end and then 
tenniaated with 1.5 ^ of 0.25 M EDTA (pH 8.0) and 
0.75 )J of 10% SDS. After excractiim with 15 mI ef 
phcodCHCi}, the aqneotts phase was mixed with 15 
m1 of 4 M ammonium aoetatei the DNA was precipitate 
ed and reprecipltaied with 60 fd of ethand, and the 
find pellet was rinsed with ethand. 

Step 3t AMm cadenndaaae dlgeBtfan. The pellet 
was dissolved in 10 m1 of buBbr containing 20 mM Tris- 
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FIQ. 2. Steps in the construction of piasmld-cDNA recombioanta. The designations f6r the DNA segments 
are as mentioned hi Fig. 1. Bxpertmentd dataill ind comaienls on the procedure sre presented in Materials and 
Methods and Rasuks, respectively. 



hydroOloridi (pH 7,4), 7 bM MgCIa, fiA NtQ, 
udO.lmiofBSAp6rtidinltteadiiiitidwithZ.su 

of tftedin endcmQclmi for 1 h at STC. 
w&s twiniiitM with 1 Mi of 0*25 M EDTA (pH S.(9 ai^ 
0.S M of IM SDS, tad, after axtractton witii phend* 
CRCU fbUowed by the adfitiop of 10 »U of 4 M 
anunoDium acautc, the DNA wtn pradpitatod with 
40 (J of cthand. The peDflt wu rinaad with athaoel 
and than diiulved In 10 ul of 10 mM Tris'hydroehlo-- 
rida (1^ 7 J) and 1 ipM BDT aDd 3 Hi of ath ml wai 

added to pi«vant teefing duriag st£vige ai -20*C, 

Slap 41 CydMlta nadWad by thi «||||d(dO)-talkd 
ti^MT DNA. A l-iJ taiaple of tba HifiOu andosude- 
&«#.digested olinKdC)*Uiled cDNA^mKNA planrdd 
(0,02 pmctD wiatoQubatad in a ndictun 
ins 10 mM Triv^iydroohloTida (pH 7,5), 1 nM E»TA, 
01 M KaO, and 0.04 pmol of the 6yto(dG}-tsUad 
linktr fiNA (Oda ammnl is a twofold molar oxeaai 
over the qliaAticy Of the yactor-cDNAtoiKNA and of 
the fkagmant wmch rafnaiiu as a mutt of the HMStL 
andoBudeaac digeitioo in the privious nap) at fiS^C 
for 1 min, ihiftod to 42rCfor 30 fflin. and fhen cooled 
to 0*C. The mUlura CIO »U) was a^uited to a volume of 
lOO uj oontpiniBB 20 mM Tria4sydn)ehletide (pH 7.5), 
4raMM|a» 10inM(MH4}aSO4, 0.lMKa.SOMof 
BSA per ml, and 04 tnM p-NAD; 0.6 fi« of £. coif 
DNA liiaaa was added, and tna lohition was thon 
incubated ovemightat IVC. 

tep 8i Bapiaarnant ar BNA scrand by DNA. To 
ro^aoa the KNA itrand ef the interti the lii^tion 
DilxhM was adjusted to eontalA 40 of each of (he 
four daoieyouGleotide triphoiphatei* O.IS mM 9-NAD, 
0.4 M <>f additional E, coli DNA Ugase, 0.3 ufctE. 
coU DNA polymeme I, and 1 U of £. co/T XNasa R. 
This mixture (104 >!} wai ineubaled aocoessivaly at 
12*C and room tempaiMire fbr 1 h each to promote 
Qptifnal repair synthesis and nick uanslatioD by Fell 
(23). The raactlon was tenninatcd fay the addition of 
0.9 ml of eold 10 oiM Trifr-bydroehlorida (pH 7.3), and 
0.1-ml aUquot^ were itored al VC. 

Step i: imslMnnntlen af £* tot. TtnnsfbnnadoD 
waa carried out uiiog minor modifteations of the 
proeedttva deacrlbad by Cohen oc al. (B)i £i eoU k-12 
(strain HBlOl) was grown to 0.5 absorbsney unit at 
600 nm at 37*C in 20 nd of L-brodi. The aaOs wcf« 
eoUected by centrifuniian, su^eAdad in 10 Od of 10 
mM Tfls-bydrocMotide (pH 7.3) containing SO mM 
CaCIs* and oantdfufiad at 0*C for 5 min . The coOs ware 
rcfuspended in 2 ml of the above buffer and ineubaiad 
agdn at O^D ftr 5 mk; then, 0^ ml of the cell 
suapanslons was mixed with 0.1 ml of the DNA 
sohition (Sttp 5) and incubated at 0*C for 15 mln* Aiter 
the eellfi were kept at 37* fbr 2 mio and at room 
teiqperatuie for 10 min, 0,5 ml of L-broth was added, 
and tha euHure was ineubaeed at 37*C for 30 mia, 
mixed with 2*5 oil of L^broth soft agar at 42*C^. and 
spread over L>'broth agar containing 50 of ampicil- 
lih per ml. After inmibatkm at 37% fbr 12 to 3i h, 
iadLYidutH cciameB were picked with Sterile tooth- 
picks* 

CharaaarfeBlta ol tDNA cloaac. 00 Cetooy InrbrM- 
tiatton and iaelallen eC rewitdnant plasnddc. E, ccH 
ttansfonnanu were screened tor the pxesan&e of glo> 
bin cDNA by in situ ctAmy hybridixatkm (15). One 
hundred tnnstemanu wai« grown on (brae replica 
nitroQaltulosc Star diika, lyscd with alkali, and hy- 
bridized with [^PI*obNA syntbaiiaad by oligD(dT)- 



primed reverse tiansci^tion of rennbeyte mRNA. 
AUematively, "^adek^translated a-g|obin or>«lobtn 
cDNA clonea, which had been re«peetiv«ly construct- 
ed and id'entified in the pccsont work xn prepared 
previously (29) from pBOl (10), wtn the globin cDNA 
probes; each itf the cDKA'a waa removed from the 
vector by lastriction entymo dlgosUoos md isolated 
by agarosa gal elcctnxphoreaii bcAizc ose as hybridiza- 
tion probei. Colonial that gave positive hybridisation 
slgwds were grown in L broth containing 50 »Lg of 
ampicillin per mh und their plasmid DN As were isolat- 
ed by standard taclmiques (21). 

m lUatricden m^dng. DNAi wcit digeaud With 
£coRIi fitlt and ^iill, or Avon restriction andomi- 
eleases under eondltions recommended by die suppli- 
ers, and analyzed by agaroae gel (l.Sft) electrophore- 
lil. 

(ISS) DNA leqiiaiefaigi DN As were digested with fill 
endoRucleaae, ineubtted with £. co2f alkaline phai« 
phatasa to remove the terminal ptmpbates, and cenni- 
naUy labeled with I^^PIATP and polynucleotide ki- 
nase (26). After digesdon with fcoRl andottuclease, 
the "^-labeled fragment containing the 5' end of the 
^obin cDNA was pUiiflod by agaroM gel (1.5%) 
electrophortsii &nd sequenced by the method of 
Maxam and Gilbert (26). 

RESULTS 

Expeiimcntal details of the procedure for pre- 

Suing and cloning cDNA^s are presented above, 
ere wa consider the r^oimln and several gen- 
eral features of the method aotf illustrate its 
application to the cloning of fun« and nearly fidl- 
lengtb O"* and 0-globin cDNA's from rabbit retic- 
ulocyte mRNA. 

Key features of the protocol (outlined in Fig. 
2) arc that (0 the plaamid vector DNA fitnctions 
as the primer for the synthesis of the first cDN A 
strand, an innovation first introduced by Rab- 
bitU (31); (U) the fuU^ or nearly foU-len(th re- 
verse transcripts of the mRNA are preferentially 
convened to duplex cDNA's anid cloned as 
recombinants in £, eoH in the subsequent steps; 
and (iil) niok-tnnsbtion repair of the cDNA; 
mRNA hybrid, mediated by E, coti RNaae H, E, 
coU Poll, and £. caU DNA ligase, Is used to 
ayntfaeaize the second cDNA strand. 

The vector-primer, a Imcar DNA with » 
poly(dT) tail at one eiKl» was constructed as 
outlined In Pig. 1. The ^R322-SV40 (map units 
0.71-0.86) DNA recombinant was a convenient 
precursor because it contains both a unique 
restriction site (Jl^nl). at which 3' sbgle-strand 
ends can be generated for the elSicient attach- 
ment of poly(dT) tails, ^d a second uniqiie 
restriction site {Hpcl) near one end to perfhlt 
lemoval of one of the poly(dT) tails. This parclc« 
ular DNA was usaftil (6r the pui^se described, . 
but othar DNAs whb aiinilar axrangeaients of 
appropriate restriction sites could be substitut- 
ed. 

Annealing of the polyCA)*** mRNA to the 
poly(dT)-tailed vector-primer DNA generates 
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(he sabstxvte for revenc tnnsciiption of the 
mRNA sequtnct, A l.S- to 3-fidd molir excess 
of mRNA over poly(dTMiiled DNA is edvafita- 
geoui to minimize the tioiiibiUty thrt asewted 
polyMT) tails of the yecto^primer DNA uiU be 
an acceptor for otlgo(dC) tedls in the mxi step. 
Under the oenditions osedi more than 60% of 
the vectar-i»rimer DNA aoquired a oovaienily 
link«d reverse tnnicript of (he mRNA durine 
the first 10 min of incubation (deu not shown); 
moreover, from pilot experiments with a 
poly(dT) primer and ("FldCTP as one of the 
deoxynudeoside trijihosphites, at least SO to 
60% of the cDNA copies attained the length cf 
the glc^in mRNA. 

Tht aim Of the next step is to generate a 
cohesive tail at the end of the cDNA so that it 
can be ligated to the oihar end of the vector 
DNa and. thercbyi provide the template for 
second-strand cDNA synthesis. This has been 
achieved by adding 0li80(dC) tafls to the 3' ends 
of both the cDNA and vector DNA and removal 
of the oUgo(dC) tail from the vector DNA termi- 
nus by deavaga ai the unique Hindm restriction 
site near that end. Since the mRNArcJDNA 
hybrid Is a very poor substrate for HifiHU 
endoouclease (27), it remains iniact during di- 
gestion with limiting quantltiofi of the restriction 
enzyme. 

The ensuing vcctor>cDNA:roRNA derivative, 
with a HiniOXL cohesive end and an oUgo(dC) tail 
at the respective termini, is cycltzed by E, coli 
DNA ligasc uaing a short linker DNA segment 
containing a HinAVS. icobcsivc cad apd an oligo 
(dO) tail. Covalent Joining of the linker and 
vector DNAs occurs via their HinillL cohesive 
ends, and a noncervalent, bast^^iiixed join ii 
made to the cDNArmRNA duplex via the oli- 
go(dO) and oUgo(dC) tails. In pracUce about 
20% of the linear vectoi'*cDNA:ndtNA was con* 
verted to circular structures as judged by the 
electrophoffotic shift in agarose gel. 

Since tranafidrmaiion of coll (HBlOl) with 
recombinants ccmtaintng eDNA:mRNA Inserts 
is inefficient and yields cDN A dones with vari- 
able but extensive ddetions in the insczts (43), 
we replaced the mRNA strand by the cozrc- 
sponding DNA sixwd in vitro. This is accom* 
plishcd by using £. coli RNase H to introduce 
ucks in the RNA strand (25), E. coli Poll tnd 
the four deoxynudeoside triphosphates to re- 
place the RNA segments by nick translation 
(23), and coli DNA ligase to Join the iie>vly 
synthesized DNA fiwnents into a continuous 
second cDNA sirahd. In the repair synthesis, 
the oligo(dQ) tail of the linker DNA serves as the 
pnmar for eopying any unpaired deoxyribosyl- 
cytidiae (dC) sequence and cxiendiDg the strand 

to the cDNA region. Similaity, If necessary, ihe 
coli Poa extends the oIjgo(dC) and p^(A) 




tnilft to produce complete pairing with the oli- 
go(da) and poMdT) of their respective opposite 
strands. E. coU DNA ligase was chosen in pkcc 
of the T4 enzyme because of its inability to join 
adiiacent RNA and DNA segments (13) that arise 
during second-strand cDNA synthesis. 

Applying this procedure to rabbit reticulocyte 
mRNA produced about 10^ amincillin-resistant 
HBlOl clones per m of starting mRNA (Table 
1)» PaQure to bridge or ligate the ends of the 
cDNA and vector by omission of ihe linker 
DNA segment or the DNA ligaie resulted in 
drastic reductions in the number of bacterial 
transfonnantSr If the mRNA stxind was not 
xoplaced by the second strand of cDNA. the 
number, of bacterial transfonnants was reducod 
fivefold. 

Characterization of a> and B-globin cDNA 
dones. To estimate the yield aqd identify the 
clones cooteinh« a- and p-globin cDNA. colony 
hymldizations (IS) were performed on 100 ran- 
domly chosen baetBrial transfonnants (Pig. 3), 
Figure 3A» and C show that about 80% of the 
traosfbrmanti contained recombinant plasmids 
with nucleotide sequences homologous to total 
reticulocyte cDNA, and 85% of these were 
accounted for by a-globin (50%) or p-gloWn 
(3S%) cDNA derivatives, Presum^ly, the 
dones with cDNA bserts Chat failed to hybrid- 
ize with a. or &*Elobln cDNA contain nondobin 
cDNA's. 

A group of clones containing 21 n- or 12 B- 
giobin cDNA*s were analyzed further by restric- 
tion enzyme digestions to determine the size of 
the cDNA Inserts . The diagram in Fig. 4 summa^ 
rfzes the expected fragments based on the 
known restriction sites in the recombinant with a 
complete a-glpbin cDNA segment. Figure 5 pre- 
sents the same infonnation for recombinants 
with a complete p-globin cDNA segment, EcoSd 
endonudease dtjscstion of the a* and B-globin 
cDNA recombinant DNAs reveded that 10 of 21 
tt-globln and 4 of 12 p<globin cDNA dones gave 
the expected sized firagment (shown by the posi- 
tion of the arrow) for a complete cDNA copy 



TABLE X. Yidd of ampicimiv-rvilsiant 
transfeimaflts obtained by cDNA dodos proc edure 

Proeadurv T^voiemuuiu 

" ^ too , per binJtK A} 

V 'n-l'"r ■ » IW-OOO 

- Obeo(dC)-tsiled : 

l^^ipDNA..,. 2.400 

-DNAKgass,.... ^''^JJ 

- DNA polymcrais 

sad RNasc H n qoq 

b,'a^^^l\S!^ to. th» procedure as described 
whKh the indicated component or step waa omitted. ' 
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FID. 3, In iitu colony hybridkatioB to detect cDNA elones. One tnindrtd independrai ampiciQia-mlstant 
tnmsfbcniaitti were pIcM and grown on three TcpKca nbmeelbilofte filidrc. Detection of cDNA donee wks 
acccMBpIUIiBd by colony hybcidizatUn CIS) uiliig thnt diBratprobci; (A) total cDNA synthesized by rcvene 
tnntcripKkmofthe rabbit reticulocyte mRNA usedterckmlog; (B) cloned nbbit a-tfobiaeDN A prepared in this 
work and identified by resttictiofi aaalyies; (C) cloned p^-i^bin cDNA made by Eftiratiadis ct al. (IQ) and 

radoned by MuUigaa ^ tl. (29). 



Each of theee plaamU DNAa (10 for a-^bln 
and 4 for ^fl^bin) wae dlfdited wifh All endo- 
naekaae to cleave at the reeonstrueeed Pstl 
restriction site a4)aGent to the oU£o(dG!dO Join 
and with PvuU mdanucleae e to dme tiie vec- 
tor DMA aequence axyacent to the poly(dA:dT) 
join, AS bat 2 of the 10 elonoa, judged to contain 
naaity {taU-lansdb a-ilobin cDNA by the EcaSIl 
CAdDnueleaaC dtgeitioki, contained theae two re- 
striction Bttoi and yielded the predicted leneth 
frBgmentfroQi a double digeat; aloularly, 3 of the 
4 0^gbMn cDNA clones^ judged to be nearly ftiU 
length by EcoSll endpnucleaae denvage, ofo- 
dnced the expected sized CIrtgmenti after Pstl- 
PvuU endonudcaae digeations* The fuzzy and 
hetoxogeneooa bands leen in the Pstl and PviM 
endonucieaee digeats of u-glDbiQ cDNA clone 2 
and ^tf obio cISN A clone 12 are probably due to 
haterogenehy in the poly(dA:dT) tegmern that 
arises during xbtir propagation in £. coh*. The 
eighi putative oosnplete a-j^obin cDNA clones 
also contain the two known AvaQ restriction 



sitei, spaced to give the expected sized fragment 
(Pig. 4). 

The S''t>roxlmat nucleotide sequences of the 
eight a- and ihree p-globin cDNA segments 
were dcterroiaed to establish the completeness 
of their cDNA's. After ^ labeling at the recon- 
struetad Pstl icstriction sites adjacent to the 
QUgo(dO:dC) joiUi about 120 nucleotides in each 
cDNA were sequenced by the Mflxam-Oilberf 
method (26) (Fig. 6 and 7). Each figure shows 
the detetmined cDNA sequence acUaoeni to the 
linker oligo(dO) segment and the known 5'- 
terminal nucleotide aequence of the correspoad-' 
Ing globin mRNA., The nucleotide aequeAces 
distal to those shown here ware ideotical to the 
already reported sequence for rabbit (IS) and 
^-globin (1(0 cDNA's. 

Two of the eight a-globin cDNA^s contained 
all the nucleotides at the 5' end of the mKNA 
sequence, preceded by 14 or 26 dO residues of 
the oligb(dG;dC) linker. The other six lacked the 
first nucleotide at the S' end of the mRNA 
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FIO. i. 9»uMn o^na^inM UUOyMI of a-ffMa tfDNA cknci. Flaimid UNA Keaaitd tram H 

jo^Mana with a-M cDKA iniem were digwud wtch Ecotl (iippeSrSl ami A«3? ixl-n 



sequnca ud had oUgoCdOidC) linken raafini 
u length from 15 to 21 nsUuM. QftbeihrM B- 
0obiii gDNA's that were •«qu«iiced, one cofl> 
tamed tin MtiK 5' end of ih« RiRNA sequence, 
one iKtod one nucleotidti and die other Indeed 
three nudeotidei. Tltk linker segnwata In the B- 
l^obin cDNA clooef wen 14, 16, «nd 30 nudeo- 
tidei in length, but the latter two eontaioed, in 
addition to dG, the unexpected nucleotides dC 




and Jfr. Tbeae were probably Inadveiteatly in- 

SJifn^ -'fSS'^ mcomplete removal of 
dOTP and dATP after the revene transcription 
and before the additioa of the oUgo(dC) taile to 
the end of the cDNA atraad, 

Asiuining that the cDNA's analyzed are a 
representative sample of the cloned cDNA pop- 
ulation, we estimate that 30 to 50% of die cDNA 
segments are M- or nearly fldl^Ungth copies of 
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FIG. 6. C<^iiipftEison of 5'-tOTniiiAl nueUotide &e- 
qucncev of t lyht putative fgU»lin|di a-glflbin cDNA'i 
with tto knowa S^tenuiDil ■♦qutnee of oitfaHa 
mRNA (II). 



thtir mRNAU. Quiet liktly, stm more of tht 
globin elonei contain w oatiTe codmg sequenee 
ifaice the initiator AUOs for a-^bin and 
globin are 36 (18) and 53 (10) mielactidtii re- 
spectively, from the S'-eappcd nucleotide 

DISCUSSION 



ittandi baae^pairad to lonser mRNA*t (34); OiO 
RaplaocawQt of the mRNA by « lacond cDNA 
stzand ii effected by fha actions of £. coif RNaaa 
H (to produce nicks and |api in the RKA 
strand)* £< coll Pofl-medUted nick translation 
(to rqpUee RNA with DMA sequancas), and E. 
coll DNA ligaae (to join the DN A fraBmmts into 
iha coAtbuooB second cDNA stnnd). 

In tins paper the procedure lia^ been used to 
clone fliU-lragth rabbit a- and 0-globin cDNA. 
However* the high cloning efficiancy makes it 
feasible to isolate rare as wall aa abundant 
cDNA*s from a variety of cellular euRNA popu« 
lations. For example, a cDNA library containing 
2 X 10^ dones per m of mRNA, made with 
mRNA from Chinese hamster cells, contained 
severs! clones that code for the BS^OOO-dalton 
subunit of adenine phosphoribosyl transferase, 
one of which contained a cDNA segment of 0.93 
The cDNA cbnini method described here is kilobases (kb) and is cuirendy being sequenced 
general, being epplicable to purified or mixed ^ determine its structure (R. Axel, private com- 
mRNA, to poly(A)^ viral RNAa, or to ceUular munication). An analogous cDNA library from 
RNAs to which poIy(A) tails can be added at the hamster cells (A29) that synthesize a metfaotrex- 
3' end, The procedure is relatively rapid, taking ^-resistant dlhydrofolate reductase (14) con^ 
less than 2 dl^s from reverse transcription to the cDNA clones whose lengths are sufficient 
collection of cransformants containing the to code for that enzyme (M. Diatkmann, 
cloned cDNA's. Moreover, it uses commercially Okayama, mi P. bers, unpublished data). In 
available reaients and enzymes and standard both of these cases the relevant mRNA's are 
biochemical procedures. Aside from the relative estimated to be present in sliout S u lOD copies 
ease and rapidity^ the most significant advantage cell^ M. Ballivet, J. M. Lee, h Patrick, and 
of this procedure, relatjvo zo those previously S. P. Heinsmann have informed us (private 
used, is the hi^ efficiency of cloning fhll- or communication) that, using this procedure, they 
nearly fuU-tenglh cDNA's. Applying the proce- b&ve isolated fiin- or nearly fiill-sized cDNA 
dure to rabbyi ^culocyte mRNA yielded ap- doues that code for the 60,000-dalton subunit of 
ptcximately lo' globin cDNA dones per m of Torpedo californica acetyl choline recepior 
mRNA, of which 10% had the complete mRNA (20): moreover, Shigasada and Stark (private 
sequence St the S' end and 20 to 40% had all but communication) have obtained a cDNA clone of 
two or three at the S'-most nudeotldes of the about 6.5 kb fh)m call RNA containing the 7.9- 
globin mRNA sequences. As antldpated, incor- Us CAD messenger (30). The eSdent produc- 
poration of one of the complete a-^Un d)NA tion and nudeotide sequencing of neariy ftill- 
segments into the vector SVGT5 (29) yielded length cloned cDNA's has also f^iUiaiad the 
SV40-like late mRNA's containhig the o-globin characterization of normal and abetrant spUdng 
cDNA sequence and immunoprec^table a-gl^ patterns in thdk- corresponding mRNA's (S. C. 
bin protein (Okayama and Berg, unpublished Clark, M. C. Nguyen*Huu, H. M. Goodman, 
data). and P. Berg, unpublished dat^. Additional stud- 

Although each of the factors contributing to establish the efficiency of this cloning 

the ralatlvdy high cloning eflUiency of mil- 
length cDNA's has not been My evaluated* 

much of the success, we believe, can be ascribed ^tttoit^ ,pna 

to the following ftamrss. (D Priming by the 

plasmidDNA far the synthesU of the first cDNA '^"^ »'Wc«uuscuuuusacacaa-— 

strand achieves the first join of the cDNA to the w \ 

vector, (ii) Seloctiw doniog of complete or — ' — — --^--^^^mm^mmmmm'— 

neariy complete cDNA copies ofthc mRNA' sis 

necessary because the dlgo (dC) tails needed to " 

effect ring dosure and complete second-strand w 

cDNA synttiosis are synthesized more efficient' * ^^^^-rrmmm^mmmfm"" 

ly by terminal tnnrfense with con- no. 7. Cami»hu» nf 3'.T.nninil saquancss of 

tauung base-palred, fliQ-leoech reverse tran- three puiadvt iS-tagth ^^gtobin rt^i^i^ 

Bcnpts than with shortened or truncatM cDNA known 5 -icrmfaial sequsaee of ^glotaln mRNA (10), 



^ 

J 



Vol. 2, 

procadu 
co^esc 
Ours 
cDNA^a 
limited 

cDNA*3 
(6, 10. 3: 



(16) _ 
cDNA 
cDNA's 
quenoe 

idcntifioi 
cDNA^ ( 

ftinetidn 
the vcctf 
ploring 1 
canexpr 
bacterial 



Thii mi 
OM>lSaS5 

National C 
Amoric«D< 
CUldcF^n 



1 Avhr, 
active J 
oliSQCh) 

2. Bactei 

reitrieu 
Oeasii 

•SBWA.2 

xooMic 
4. Brtfttbtt 

Ovilbuii 

ovomue< 
fi' Chs^ A 

£riiBh,S 

CMrt«h 

J. Rmttf 

8. CDhn^g 
motoottl 

Ntd. Acj 

9. C»MK,J 
Okici, W 
aod ehsr 
ias52U5< 

friSury a 

from Qjofl 

Knthltkd 

^ 1976. 
Cell 7:r5 
12. Efttrtttad 



. procedure ud the conplateneti of tb* cDNA 
copiBt of othGr ct&ular Beoes m in promts, 

0"f» other ipproMhes for clooioi 
CDNA t (and eenoaitf DSA as wvU) an deuly 
Itottted by the iWUty to d«t«ct partieultf 

S?™!?^' hybridiMtioii 
V6i 10, 31, 39 or functional ausyt (br tbe mRNA 
(16) lie tlia Belt meioA to Utnti^ specific 
cDNA dooat. The relatively fajgh yidds of 
cDNA'a whb a contpUie protein codiof 
quence oAn another aliematlve, naoiely, the 
id^tfleation or even aeleettve propigatloa of 
CDNA doQf* 00 the basis of the product or 
fiinetloa they express. By appropriete choice of 
the vcctor-pttmer and linker DNAi we ate ex- 
pionni tte conittttction of leeomblnants that 
m txpress the dooed cDNA'i direetly in either 
oacTerial or oianDMlian cells. 
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